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ABSTRACT 

The addition of uridine nucleotide by the poly(U) 
polymerase (PUP) enzymes has a demonstrated 
impact on various classes of RNAs such as 
microRNAs (miRNAs), histone-encoding RNAs and 
messenger RNAs. Cid1 protein is a member of the 
PUP family. We solved the crystal structure of Cid1 
in complex with non-hydrolyzable UMPNPP and a 
short dinucleotide compound ApU. These structures 
revealed new residues involved in substrate/product 
stabilization. In particular, one of the three catalytic 
aspartate residues explains the RNA dependence of 
its PUP activity. Moreover, other residues such as 
residue N165 or the p-trapdoor are shown to be 
critical for Cid1 activity. We finally suggest that the 
length and sequence of Cid1 substrate RNA influ- 
ence the balance between Cidl's processive and 
distributive activities. We propose that particular 
processes regulated by PUPs require the enzymes 
to switch between the two types of activity as 
shown for the miRNA biogenesis where PUPs can 
either promote DICER cleavage via short U-tail or 
trigger miRNA degradation by adding longer 
poly(U) tail. The enzymatic properties of these 
enzymes may be critical for determining their 
particular function in vivo. 

INTRODUCTION 

Cytoplasmic messenger RNA (mRNA) homeostasis is a 
complex phenomenon in which the balance between 
mRNA stability and mRNA degradation is tightly moni- 
tored. Eukaryotic species, from plants to mammals, rely 
heavily on the poly(A) tail to control the mRNA degrad- 
ation process (1-2). Recently, a long-known family of 
proteins, named poly(U) polymerases (PUPs), was 
shown to be an unexpected player in mRNA homeostasis 
in various eukaryotes (3-6). The PUPs add poly(U) tails 



to mRNAs, which appear to modulate the rates and/or the 
directionality of the mRNA degradation process (4,7-9). 
Furthermore, members of the PUP family also regulate 
the production of histone mRNAs, microRNAs, siRNAs 
or U6 snRNA in the nucleus (6,10-17). 

The PUP enzymes contain a nucleotidyl transferase 
domain, which has been characterized structurally 
(18-23). These structures show that the nucleotide triphos- 
phate and the substrate RNA are bound in a space 
between the catalytic domain and the central domain 
(21,24). The PUP Cidl protein was identified in 
Schizosaccharomyces pombe and was shown to have the 
capacity to use both uridine triphosphate (UTP) and ad- 
enosine triphosphate (ATP) as substrates for polymeriza- 
tion in vitro, while apparently being specific for UTP 
in vivo (25-26). This is surprising, as other members of 
the PUP family were depicted to be highly specific for 
UTP (18-20,27-28). Residue H336 found in the nucleotide 
recognition motif (NRM) loop has been shown to 
be mainly responsible for determining the specificity of 
the nucleotide recognized in vitro and in vivo (21-23). 
Additional charged residues were shown to be involved 
in substrate RNA association, although their exact 
function was not entirely characterized (21-22). 

To further understand the mechanism behind RNA rec- 
ognition and elongation, we determined the structure of 
Cidl bound to a non-hydrolyzable nucleotide UMPNPP 
and bound to its minimal pseudo-product ApU. These 
structures revealed new key residues for the substrate/ 
product recognition. In particular, one of the three cata- 
lytic aspartate residues interacts with the 2'-OH group of 
the pseudo-product-bound dinucleotide, thus explaining 
the RNA specificity displayed by this class of enzymes. 
Moreover, residue N165 interacts with the adenosine 
base in our pseudo-product-bound crystal structure. This 
interaction is essential for the enzymatic reaction. Finally, 
we demonstrate that the polymerizing capacities of Cidl 
are modulated by a highly flexible loop named the (3- 
trapdoor (residues 310-322) and by the residue K144 
when UTP is used as a substrate (22). Our study thus 
highlights the structural basis behind several unique 
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properties of the PUP enzyme family: properties mediated 
by a combination of specific residues and conserved sec- 
ondary structures. We speculate that the specific type of 
activity that PUP enzymes display may determine their 
involvement in particular biochemical pathways and 
require factors to regulate their polymerizing activity. 



MATERIALS AND METHODS 

Expression and purification 

Schizosaccharomyces pombe Cidl protein (residues 40-377) 
was expressed and purified as described previously (21). 
Briefly, Cidl was expressed in Escherichia coli BL21 Star 
cells (Novagen) as a His 9 -MBP fusion protein and purified 
using HiTrap FF crude column (GE Healthcare). After 
cleavage with the Tobacco Etch Virus protease (ratio of 
1/50) at 16°C, the sample was reloaded onto the HiTrap 
column to remove the tag, the protease and the contamin- 
ants. The purified protein was finally applied on a Superdex 
200 column (GE Healthcare), in a buffer containing 20 mM 
Hepes (pH 7.5), 150mM NaCl, ImM (3-mercaptoethanol 
and 10% (v/v) glycerol. Fractions containing the protein 
were pooled and concentrated (30000 MWCO Amicon) 
to lOmg/ml and stored at — 80°C. All protein samples 
eluted from the gel filtration column as monomers. The 
same expression and purification procedure was performed 
for all the Cidl mutants. 

Mutagenesis 

Cidl point mutants were prepared by site-directed muta- 
genesis using the Quick-Change mutagenesis kit from 
Stratagene according to manufacturer's instructions. The 
Cidl A310-322 deletion mutant was prepared by Nested- 
polymerase chain reaction. All the constructs were verified 
by DNA sequencing. The oligonucleotides used for the 
mutations are shown in Supplementary Table SI. 

Protein crystallization and X-ray data collection 

D160A mutant protein was used at lOmg/ml. Crystals 
were produced using the sitting-drop vapor diffusion tech- 
nique at 18°C, with a reservoir solution containing 0.1 M 
imidazole/MES (equal ratio) (pH 6.1), 12-20% glycerol, 
6-10% polyethylene glycol (PEG) 4000, 126 mM of a 
halogen salt (Nal, NaBr, NaF in equal ratio) and 
10 mM TCEP [tris(2-carboxyethyl)phosphine] as an 
additive. Crystals appeared in 2 days and grew to a 
maximum size of around 0.25 x 0.25 x 0.2 mm in 1 week. 
Crystals of D160A mutant were soaked for 16 h with 
2.5 mM UMPNPP and UTP or with 2.5 mM of ApU 
and ATP, in both cases supplemented with 5mM of 
MgCl 2 . Crystals were passed into the crystallization 
solution supplemented with 10% glycerol before flash 
freezing in liquid nitrogen. 

Diffraction data for the UMPNPP-soaked and 
ApU-soaked protein crystals were collected using the 
Beamline ID 14^1 at the European Synchrotron 
Radiation Facility (ESRF, Grenoble). Complete data 
sets for UMPNPP- and ApU-soaked crystals were 
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Table 1. Data collection and refinement statistics 



Data collection 



Crystal 


UMPNPP-soaked 


ApU-soaked 


Wavelength (A) 


Cidl 


Cidl 


0.9394 


0.9394 


Temperature (K) 


100 


100 


Maximum resolution (A) 


45.0-1.90 


45.3-1.94 


Space group 


P2, 


P2, 


Unit cell dimensions (A) 


a = 53.8, 


a = 54.0, 




b = 77.3, 


b = 77.1, 




c = 82.1, 


c = 81.8, 




a = y = 90.0, 


oc = y = 90.0, 




R = 90.9 


P = 90.7 


Number of observations 


260 839 


246 855 


Number of unique reflections 


52 778 


49 569 


Redundancy 


5.0 


5.0 


Data completeness (%) 


99.5 (99.7) 


99.6 (99.2) 


I/a(I) 


28.47 (13.2) 


13.14 (2.3) 


^raeas 


4.4 (16.7) 


9.3 (87.8) 


Refinement 






Resolution (A) 


19.8-1.90 


45.3-1.94 


Number of atoms 


5688 


5569 


Protein 


5112 


5129 


Ligand 


58 (UMPNPP) 


78 (ApU) 


Magnesium ions 


4 


3 


Brome ions 


3 


3 


Water molecules 


511 


356 


Rwork (%) 


17.94 


17.92 


Rfree (%) 


20.81 


22.57 


RMSD from ideaj geometry 






Bond lengths (A) 


0.008 


0.008 


Bond angles (degrees) 


1.082 


1.077 


Mean B (A 2 ) 






Protein chain 


22.6 


33.9 


Nucleotide/ions 


26.1 


35.2 


Water 


30.6 


41.3 


Residues in favored region of 


97.52 


96.72 


the Ramachandran plot (%) 






Residues in allowed region of 


2.15 


2.63 



the Ramachandran plot (%) 



collected to 1.9 and 1.94 A resolution, respectively, and 
the statistics are reported in Table I. 

Structure determination and refinement 

The structures of the D160A mutant in complex with 
UMPNPP or in complex with ApU were determined by 
molecular replacement with the Cidl protein structure as a 
search model (PDB code 4EP7). The data sets were 
indexed with the XDS package (29). Phasing was done 
using the program Phaser from the CCP4 package (30) 
and the models were refined using the routine phenix. re- 
fine from the Phenix program (31). A large unidentified 
density was visible in both 2F Q - F c and the F Q - F c 
electron density maps where an UMPNPP or an ApU 
molecule was built (Supplementary Figure SI A and B). 
Atomic coordinates for the UMPNPP and ApU molecules 
were taken from the HIC-Up database (32) and fitted in 
the empty electron density map using the program COOT 
(33). The final Cidl/UMPNPP and Cidl/ApU models 
were refined to a resolution of 1.9 and 1.94 A, respectively. 
Both crystal structures contain two molecules of Cidl in 
the asymmetric unit. The Cidl/UMPNPP structure 
consists of residues 38-377 from the Cidl protein, 2 
UMPNPP compounds, 4 magnesium ions and 3 bromide 
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ions present in the crystallization condition. The Cidl/ 
ApU model contains the same Cidl sequence (residues 
38-377 for molecule A and residues 39-377 for 
molecule B), 2 ApU dinucleotides, 3 magnesium ions 
and 3 bromide ions. For both models, loops containing 
residues 109-115 and 309-322 were not visible in our 
electron density maps. 

Poly(U) polymerase assays 

RNA primers (Ui 5 , A 15 or ApU) were labeled using T4 
polynucleotide kinase (NEB) and y- 32 P-ATP using manu- 
facturer's protocol. Labeled RNA substrates (10 pmoles) 
were incubated with 5 or 15 pmoles of Cidl proteins 
[wild-type (WT) or mutant] in 25 ul of lOmM Tris-Cl 
(pH 7.9) containing 50mM NaCl, lOmM MgCl 2 , ImM 
DTT and 0.5 mM UTP (or ATP when indicated) for 
40min at 37°C. Reactions were stopped with an equal 
volume of lOOmM EDTA and 2% sodium dodecyl 
sulfate. Extended primers were extracted with phenol- 
chloroform, precipitated with ethanol and resuspended 
in 90% formamide loading dye. Products were denatured 
at 95°C for 2min and separated by gel electrophoresis in a 
12% (or 25% for the ApU substrate) polyacrylamide/7 M 
urea gel. 

Electrophoretic mobility shift assay 

The indicated amount of proteins was mixed with the 
labeled RNAs in 20 mM Hepes (pH 7.5) containing 150 
mM NaCl, 5 mM MgC12, 1 mM P-mercaptoethanol, 10% 
glycerol and incubated at 37°C for lOmin. Following 
incubation, 2ul of loading dye (0.1% bromophenol blue, 
0.1% xylene cyanol, 180 mM Tris-boric acid and 50% 
glycerol) was added, and the samples were immediately 
loaded on a 10% native polyacrylamide gel in 0.5 x TBE 
buffer (45 mM Tris-boric acid, 1 mM EDTA). The gel was 
then run at 300 V for 120min at 4°C and exposed in a 
cassette with an X-ray film (Fujifilm) for 30 to 45min. 
Following quantification with ImageQuant software (GE 
Healthcare), affinity constants were calculated using the 
non-linear regression with one site-specific binding option 
from GraphPad Prism 6 program (GraphPad). 

For comparing directly the RNA binding efficiency of 
various mutated Cidl proteins, we used a unique 
concentration of protein (WT or mutant) and measured 
complex formation with a given probe (U15 or A15). 
Then, we quantified and normalized the amount of 
complex obtained with a given mutant protein by the 
one obtained with the WT enzyme for the same probe. 
Protein concentrations were set to 1.5 uM for the U15 
probe and 11 uM for the A15 RNA. We used these two 
fixed concentrations of protein to maximize complex 
formation based on WT Cidl Kd value for a given 
probe and to take into account the lower solubility 
observed with mutant proteins K144A and F332A. After 
loading and running the sample as indicated previously, 
we quantified our signal, normalized it against WT signal 
and plotted it as histogram. Individual binding reactions 
were performed three times in triplicates. 



RESULTS AND DISCUSSION 

Overall structure of the Cidl/UMPNPP and Cidl/ ApU 
complexes 

We mutated one residue from the catalytic triad (residue 
D160) in an attempt to stabilize the UMPNPP- or the 
ApU-bound Cidl complexes as successfully done in the 
poly(A) polymerase (PAP)/ATP/RNA ternary complex 
structure (24). Here, we report the atomic structures of 
the D160A Cidl mutant in complex with the non- 
hydrolyzable UMPNPP and the D160A Cidl mutant 
in complex with a minimal pseudo-product (ApU) 
(Figure 1A and B; see X-ray statistics in Table I). 
Both ligands were built using unbiased Fo-Fc Fourier dif- 
ference electron density map calculated with the protein 
model only after a round of simulated annealing 
(Supplementary Figure SI). 

UMPNPP mode of binding is similar to our Cidl/UTP 
complex (21) confirming that D160A mutation does not 
induce any conformational changes in the overall protein 
fold or in the substrate nucleotide recognition by the 
protein. The only difference is the absence of the first mag- 
nesium ion necessary for the nucleolytic attack, as previ- 
ously observed in the PAP/ATP/RNA crystal structure 
(Figure 1A) (24). 

The bound ApU molecule should be considered as 
a mimic of the post-catalysis reaction state as defined 
by Stagno et al. (27). As such, ApU is considered as a 
pseudo-product of the Cidl -mediated reaction. The 
pseudo-product ApU is representative of the in vivo situ- 
ation encountered by Cidl protein as, in S. pombe, Cidl 
natural mRNA substrates are polyadenylated (Figure IB) 
(26). In the ApU-bound Cidl structure, the uridine nu- 
cleotide is recognized as in the UMPNPP and the UTP- 
bound Cidl structures (21-23) (Figure 1C). The adenosine 
base of the bound ApU molecule is stacked on the uridine 
base, a situation reminiscent of the TUT4/UpU structure 
(Figure 1C) (27). Several Cidl residues are involved in the 
stabilization of the adenosine nucleotide. First, its ribose 
2'-OH group establishes a direct hydrogen bond with the 
catalytic residue D103 (Figures 1C and D). Although Cidl 
was previously classified as an RNA-specific nucleotidyl 
transferase, the basis for such specificity was not described 
(8). We suggest that the catalytic residue D103 stabilizes 
the proper type of substrate near the catalytic site by de- 
tecting the presence of a 2'-hydroxyl group (Figure ID). 
This mechanism of RNA detection is probably shared 
across the entire family, as a similar interaction is found 
in the TUT4/UpU crystal structure (27). Second, residue 
D103 and the ApU molecule have another water-mediated 
interaction with the N3 position of the adenine, likely used 
to stabilize the incoming RNA substrate before the 
nucleotidyl transfer reaction (Figure 1C and D). The 
adenine is further stabilized via additional water- 
mediated interaction between its N6 position and residue 
E333 (Figure 1C and D). None of these water-mediated 
interactions confer nucleotide specificity on the 3'-end of 
the substrate RNA, as expected for a template-independ- 
ent polymerase (8). In the ApU complex structure, the Nl 
position of the bound adenine is recognized via a specific 
hydrogen bond with the OD1 position of residue N 165. In 



Nucleic Acids Research, 2014, Vol. 42, No. 5 3375 




Figure 1. Crystal structure of Cidl in complex with its minimal pseudo-product. (A) Cartoon representation of D160A Cidl bound to the non- 
hydrolyzable UMPNPP. The UMPNPP, the catalytic aspartic acids and D160A residue are shown as ball and sticks and colored according to atom 
type (carbon, yellow or cyan; nitrogen, blue; oxygen, red; phosphate, orange). (B) Cartoon representation of D160A Cidl bound to ApU. The ApU 
molecules are shown and colored as in A. The catalytic (CAT) and central (CD) domains of Cidl are colored in salmon and green, respectively, with 
the 'ratchet' helix four colored in yellow in A and B. (C) Side view of the active site of the Cidl/UTP and D160A/ApU superposed structures. Cid/ 
UTP atomic model is colored in gray. Residues from the NRM loop are shown as ball and sticks and colored as in A (carbon atom in magenta). 
Hydrogen bonds within 2.4-3.6 A distance are shown as black dash lines for adenine stabilization and as red dash lines for uridine stabilization. 
Amino acids from the Cidl/UTP structure are shown with thin stick radius compared with the one of the Cidl/ApU model. (D) Top view of the 
D160A showing ApU interactions. Atoms, residues and hydrogen bonds are colored as C. Magnesium ions are illustrated as gray spheres in A and 
C. All the structural panels here and in the following figures were generated with PyMol (34). 



Trypanosoma brucei TUT4/UpU structure, the equivalent 
residue is an arginine (R141), which cannot contact the 
uracil base because it forms a salt bridge with residue 
E300 (E333 in Cidl) (27). However, if we superpose the 
TUT4/UpU and Cidl /ApU structures, the N3 atom of the 
uracil base in the trypanosomal structure is located at 
nearly the same place as the Nl atom of the adenosine 
in our structure (Figure 2A). Therefore, we suggest that 
N 165 is a key residue for binding to the 3'-end of the RNA 
substrate either before or after the Cidl uridylation 
reaction. This proposal is valid whether this substrate 
RNA contains a 3'-end adenosine nucleotide, as 
expected for the native poly(A)-containing substrates in 
S. pombe, or whether the RNA went through at least 
one round of polyuridylation (Figure 2A). Our proposal 
can be extrapolated to any RNA 3'-end base, although it 
will require the nucleotide tautomeric form with a 
hydrogen-bonded Nl atom to establish a proper bond 
with N165 OD1 position. Finally, the TUT/PUP super- 
position also shows that residue R139 in Cidl is not in 



contact with the pseudo-product ApU (Figure 2B). We 
presume that the two enzyme groups may have evolved 
toward slightly different RNA recognition/stabilization 
modes, most likely reflecting their specific enzymatic 
properties. 

RNA binding properties 

Cidl has been classified as a non-canonical template- 
independent nucleotidyl transferase (3,7-8,21-23,35). 
Previous studies have also shown that the overall 
activity of Cidl is unexpectedly high when presented 
with Uis (26). To further understand the Cidl enzymatic 
properties, we generated various Cidl mutants driven by 
the analysis of our Cidl /ApU crystal structure. We then 
measured their binding properties for two different sub- 
strates (A! 5 and Ui 5 ) in the absence of any polymerizing 
activity to avoid interference from the newly synthesized 
tail (Figure 3 and Supplementary Figure S2). 

We first compared the capacities of the WT protein 
and of our mutant D160A to bind to the two selected 
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Figure 2. Structural comparison between the Cidl/ApU and the 77?TUT4/UpU complex structures. (A) Close view of the NRM loop 77)TUT4/UpU 
(PDB code 2Q0G) and Cidl/ApU showing related active site pocket architecture. r/>TUT4/UpU structure is shown in cartoon representation and 
colored in violet. The main interacting amino acids and the UpU molecule are shown as ball and stick model colored in violet or green, respectively. 
Cidl/ApU model, residues and atoms are shown and colored as in Figure 1. (B) View from the top of the active site highlighting the differential 
position of the loop between P-sheets four and five. This loop contains residue R139 (R121 in 77?TUT4). 77/TUT4 residues are labeled in italic and 
the radius of their ball and stick representations is reduced compared with the one used for Cidl residues. Magnesium ions are illustrated as gray 
spheres in all panels. The black dash lines represent hydrogen bonds within 2.4-3.6 A distance. 



substrates — mimics of a short poly(A) tail (A 15 ) or a newly 
added U-tail (U15; Figure 3A and B; Supplementary 
Figure S2A and B). The affinity constant calculated 
between the Cidl WT and the U 15 probe was ~1 and 
20 uM with the A 15 probe (Figure 3C). These values 
were identical for the D160A mutant, reinforcing the 
fact that this mutation does not alter the enzyme's RNA 
binding capacity. Overall, we observe a 20-fold difference 
between the affinity constants of the various tested Cidl 
proteins for a 15-mer homouridine as compared with the 
A 15 (Figure 3C). Cidl's intrinsically higher affinity for 
homouridine polymer probably explains why Cidl 
exhibits a more efficient polymerizing activity on Ut 5 
RNA (26). 

We also mutated residue N165 into an alanine (or an 
aspartate) and assessed the mutated Cidl enzyme's 
binding capacity (Supplementary Figure S2C and D). 
The affinity constant for the N165A mutant did not 
differ from that of the WT protein (Figure 3C). These 
values are comparable or slightly lower than the 
reported values for nucleotidyl transferase enzymes 
(23,27,36). The binding experiments agree with our obser- 
vation that both A and U nucleotide would be forming a 
stable hydrogen bond with N165 residue (Figures 2A and 
3C). We then tested residue R139, which is found in the 
loop between P-strands 4 and 5 (Figure 2B). In the 
J&TUT4, the equivalent amino acid R121 is involved in 
the primer stabilization as well as in the catalytic activity 
(Figure 2B) (19,27). The association between the two 
RNA substrates and the R139A mutant is similar to 
that of the WT protein (Figure 3D). These binding 
assays corroborate our structural data, which did not 
show any interaction with R139 residue (Figure 2B). We 
concluded that this residue is not important on its own for 
the recognition of the substrate RNA in contrast to 
7~&TUT4 (Figure 3D). Furthermore, it has been suggested 



that the P-sheet structure on top of the active site, named 
P-trapdoor, plays a role during UTP exchange, although 
its role in RNA association has not been assessed (22). We 
could not observe any significant difference between the 
affinity constants measured with Cidl truncation of 
residues 310-322 (A3 10-322) for both RNA substrates 
suggesting that the P-trapdoor does not determine the 
specificity of the associated RNA per se (Figure 3C). 

Several residues that have been shown to be important 
for the Cidl activity such as F332, K144 or H336 were 
finally tested for their potential effect on the RNA binding 
properties. Strikingly, the F332 mutant had a reduced 
capacity for binding RNAs (Figure 3D). We previously 
proposed that F332 was critical for the UTP clamping 
mechanism mediated by the ratchet helix 4 (21). We 
suggest that the importance of the domain movement 
during the nucleotide transfer likely extends to RNA 
template trapping mechanism as well (Figure 3D) (21). 
We then looked into the RNA binding capacity of the 
H336A mutant protein for either of the RNA substrates, 
which was unimpaired, thereby confirming that H336 is 
only important for the UTP selectivity (Figure 3D) (21). 
When we tested the RNA binding capacity of the K144A 
mutant protein, we observed that its association with A 15 
was strongly reduced, whereas it was unchanged for a 
15-mer U stretch (Figure 3D). Therefore, K144 residue 
seems important for the stabilization of our short 
incoming substrate RNA. These results are slightly differ- 
ent from the data published by Gilbert's laboratory, 
possibly due to the differences in the RNA used for the 
binding studies (a 40-mer RNA in their study) (22). The 
poly(A) polymerase equivalent residue K145 is found to 
contact the 2'-OH group of the nucleotide -2 in the PAP/ 
ATP/RNA crystal structure (24). Therefore, the Cidl 
protein may use a substrate stabilization strategy closer 
to the one of the PAP enzymes. Altogether, it appears 
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that Cidl enzyme is highly specialized with residues 
critical for the nucleotide triphosphate association 
(H336), residues specialized for the RNA primer associ- 
ation (K144) and residues involved in the nucleotidyl 
transferase enzymatic cycle ((3-trapdoor, D103, N165 or 
F332). 
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Figure 3. Characterization of U15 and A 15 RNA binding properties of 
WT and mutant Cidl protein by EMSA. (A) EMSA showing the 
interaction of Cidl with an A| 5 RNA. (B) Same experiments with a 
U15 RNA. In A and B, the individual reactions contain either no 
protein or the indicated amounts. (C) RNA binding curves and 
calculated Kj from quantified EMSA experiments performed with 
Cidl WT, D160A. N165A and A3 10-322 mutants for both A, 5 



Polymerization activity of Cidl mutants 

To complete our RNA binding study, we measured the 
polymerization activities of the same set of mutants. As 
we observed significant differences between Cidl affinity 
for a U-tail as compared with an A-tail, we looked at the 
polymerizing activity in the presence of either UTP or 
ATP. The N165A mutant is unable to add more than 
one nucleotide per substrate regardless of the provided 
nucleotide triphosphate (UTP or ATP) even though its 
RNA binding capacity is unimpaired, at least for the 15- 
mer primer (Figures 3C, 4A and B). If N165 residue is 
changed into a charged residue (N165D), the enzyme is 
able to add up to 3^1 uridines or up to 20 adenines to the 
15-nt RNA template but is still almost unable to use the 
ApU substrate (Figure 4, mutant N165D). Overall, these 
experiments demonstrate the critical role of N 165 residue 
in the polymerizing reaction. Being located at the begin- 
ning of helix 4, N165 is likely important during the swivel 
motion occurring throughout the catalytic cycle of the 
enzyme (21). The rotation of the catalytic domain 
around the helix 4 can open the space between the 
domains. We suggest that N165 residue is involved in 
pulling on the RNA primer after the addition of the 
UMP. On displacement of the RNA primer, the UTP 
binding cavity is emptied allowing a new UTP molecule 
to take its place, and a new cycle to start. 

Another region of interest near the NRM loop is the P- 
trapdoor (22). Deletion of the (3-trapdoor residues does 
not affect the overall PUP activity of the enzyme for 
either of the RNA primers as long as we use UTP as a 
substrate (Figure 4A and C). We can observe the typical 
biphasic profile with, on one hand, short products of 7-9 
added nucleotides corresponding to a distributive activity 
and, on the other hand, highly elongated products that did 
not enter the gel (Figure 4A and B). However, when we 
look at activity of the A3 10-322 mutant protein in the 
presence of ATP, we do not observe any larger products 
suggesting that Cidl without the P-trapdoor is almost ex- 
clusively a distributive PAP enzyme (Figure 4B and D). 
This is even more striking with the short substrate ApU 
(Figure 4D). Because these observed differences between 
PAP and PUP activities cannot be linked to a difference in 
the RNA binding properties of the truncated protein 
(Figure 3C; Supplementary Figure S2E and F), we 



Figure 3. Continued 

and U15. Individual binding experiments were repeated at least three 
times and representative EMSA experiments are shown in the 
Supplementary Figure S2. (D) Histogram representation of the inter- 
action measured between various Cidl mutants with either an U 15 or 
an A [5 RNA probe. Bound fractions correspond to the amount of 
formed complex and were normalized by the one observed between 
Cidl WT protein and either the U 15 or the A 15 probe. Experiments 
were repeated at least three times and done in triplicate. 
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Figure 4. Poly(U) and poly(A) polymerization assays with two different RNA probes. (A) Polymerization assay in the presence of UTP with an A [5 
primer. Lane 1 shows the RNA primer after incubation without enzyme. The RNA after elongation with WT Cidl (lanes 2-3), N165A (lanes 4-5), 
N165D (lanes 6-7), A310-322 deleted protein (lanes 8-9) and K144A mutants are loaded in the respective lanes. (B) Polymerization assay in the 
presence of ATP with an A 15 as substrate. Similar mutants were used and reactions were loaded as in A. (C) Polymerization assay with UTP and the 
substrate ApU. The lane 1 shows the ApU substrate after incubation without enzyme. The polymerizing reaction in the presence of WT Cidl, 
N165A, N165D and A310-322 mutant proteins is shown in lanes 2-3, 4-5, 6-7 and 9-10, respectively. (D) Polymerization assays with ATP and an 
ApU substrate. Similar mutants were used and reactions were loaded as in C. 



conclude that the P-trapdoor has an important role during 
the catalytic cycle. This may be to favor NTP exchange 
after each polymerization cycle by promoting domain 
movements (22). 

Finally, we tested the polymerizing activity of the 
K144A mutant. This single-point mutant has a PUP 
activity comparable with the WT protein, but its PAP 
activity is drastically reduced as demonstrated by the 
almost complete absence of extended products (compare 
Figure 4A and B). As observed for the A310-322 deleted 
protein, its activity is now almost exclusively distributive 



when presented with ATP (Figure 4A and B). As the 
K144A mutant has an impaired A 15 RNA association, 
we propose that the change in the polymerizing profile is 
linked with the poly(A) RNA binding capacities of the 
protein. 



CONCLUSIONS 

Our atomic models provide missing information regarding 
the association of RNAs with the uridylating enzyme 
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Cidl. We identify multiple determinants behind the RNA 
binding properties of Cidl and decipher the roles played 
by several residues or protein fragments during Cidl's 
polymerization activity. When the 3'-end of the template 
RNA is not a uridine, Cidl has a low RNA binding 
affinity and therefore a highly distributive activity reflect- 
ing its classification as a template-independent nucleotidyl 
transferase (8). Our binding tests show that on addition of 
6-7 uridine nucleotides, the enzyme's affinity for the 3'- 
end of its template becomes at least 20 times higher as 
shown by our binding tests. Following this switch in its 
overall affinity, Cidl's activity becomes highly processive. 
We show that residue K144 is involved in this switch of 
activity. The Cidl catalytic cycle is complex and its com- 
pletion critically depends on several key residues: (i) the (3- 
trapdoor controls at least partially the UTP exchange 
after the reaction; (ii) N165 residue, which is hydrogen 
bonded to the 3'-penultimate nucleotide in our atomic 
model, may be involved in the displacement of the 
elongated RNA product after UMP addition. Finally, 
the overall catalytic cycle depends on the previously 
proposed domain movement around helix 4 and residue 
F332 to proceed (21). 

Until now, the length of the characterized poly(U) tail 
due to PUP activities in vivo is generally short (between 1 
and 5nt) (4,5,37,38). We propose that Cidl enzymatic 
activity is influenced by its higher affinity for the 
homouridine-containing tails. To prevent Cidl from 
generating long U-tails that can trigger RNA degradation 
by exonucleases like Dis3L2 (9,39), we suggest that factors 
are used to control the local activity, like the targeting 
factor Lin28 (11,40). 

Lastly, as shown by the differential use of particular 
protein residues like residue R139 or K144, TUT and 
PUP enzymes have evolved differently (19,21). It has 
recently been proposed that DSS1 protein, a uridine- 
specific exonuclease, plays a critical role during the guide 
RNA maturation process in trypanosome, a process 
where RET1 TUT enzyme is critical (41,42). As our 
present results shed new light on the properties of the 
polyuridylating enzyme Cidl, it will be exciting to 
see whether a similar switch exists in the RET1 enzyme 
(43-45). 
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